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We develop a new test of local bias, by constructing a locally biased halo density field from 
sampling the dark matter-halo distribution. Our test differs from conventional tests in that it 
preserves the full scatter in the bias relation and it does not rely on perturbation theory. We put 
forward that bias parameters obtained using a smoothing scale R can only be applied to computing 
the halo power spectrum at scales k ~ 1 /R. Our calculations can automatically include the running 
of bias parameters and give vanishingly small loop corrections at low-fc. Our proposal results in much 
better agreement of the sampling and perturbation theory results with simulations. In particular, 
unlike the standard interpretation of local bias in the literature, our treatment of local bias does 
not generate a constant power in the low-fc limit. We search for extra noise in the Poisson corrected 
halo power spectrum at wavenumbers below its turn-over and find no evidence of significant positive 
noise (as predicted by the standard interpretation) while we find evidence of negative noise coming 
from halo exclusion for very massive halos. Using perturbation theory and our non-perturbative 
sampling technique we also demonstrate that nonlocal bias effects discovered recently in simulations 
impact the power spectrum only at the few percent level in the weakly nonlinear regime. 



I. INTRODUCTION 

Modern galaxy surveys are wide and deep enough to 
allow us to compute the statistics such as the power spec- 
trum and bispectrum (or their real space counterparts). 
However, galaxies are known to be biased objects relative 
to the underlying dark matter [T], thus we must under- 
stand the bias relation in order to extract information on 
the dark matter, which contains cosmological informa- 
tion. As a first step towards understanding galaxy bias, 
we need to understand the bias relation for dark matter 
halos, where galaxies are thought to reside. There exists 
several prescriptions for the halo bias predicting the bias 
parameters from the initial conditions. These approaches 
assume some heuristic mechanisms for the identification 
of halos from the initial density field. The peak bias for- 
malism [IH2] assumes that galaxies form at the peaks of 
the initial density field. Another popular model is the 
peak-background split, in which the density field is split 
into long and short wavelength parts. The long wave- 
length perturbation modifies the background density and 
the short wavelength causes local galaxy formation [I] 0]- 
6 . All these approaches are valuable for understanding 
the physics of bias, when compared to simulations they 
work remarkably well, but are not accurate at percent 
level. For example, although it is found that about 70% 
of the well-resolved halos can be associated with peaks 
in the initial density field [7], a significant fraction of 
halos that do not correspond to peaks. Comparison of 
the linear bias from peak-background split calculations 
shows that there are still 5% discrepancies with ./V-body 
simulations at large scales [5]. 

A commonly adopted approach at large scales is to use 
the local biasing prescription 9 , i.e. the smoothed halo 
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density field S^r is assumed to be a local function of the 
smoothed dark matter density 5r 

°° h R 

Mx)=E| (M*))', (i) 

i=0 

where bf- is a free parameter. We note that this local 
prescription only holds for the smoothed density fields, 
in which the small scale modes have been removed. We 
should regard Eq. ([lj as a large-scale effective descrip- 
tion. In the bias parameters, we have used the super- 
script R to emphasize that the Eulerian bias parameters 
in general depend on the smoothing scale. This is the pre- 
scription adopted in interpreting modern survey data for 
two-point and in particular higher-order statistics jTDJ- 

As the local biasing prescription is the simplest well- 
motivated model, a number of works have been devoted 
to testing it [5] [T6 20 . In practice, the series is of- 
ten truncated at the quadratic order, sometimes at the 
cubic order. When being tested against simulations, a 
quadratic or cubic polynomial is fitted to the scatter plot 
between Sr and S^r to extract the bias parameters. Also 
in the test perturbation theory is often invoked to com- 
pute the correlators, such as the halo power spectrum. 
There are two peculiar features of the local biasing pre- 
scription that deserve further examination. The bias pa- 
rameters in the scatter between Sr and S^r vary when 
the smoothing scales changes [18 -20J. On these grounds 
this method is sometimes dismissed as an unreliable way 
to measure the bias parameters, e.g. [2D]. In addition, 
unless the calculation is stopped at the tree level, it is 
well-known that higher-order loop corrections give rise 
to large undesirable corrections even at large scales un- 
less an aggressive smoothing window is applied [2TJ [22] . 
Both of these problems are especially obvious when the 
smoothing scale is small. For small smoothing scales, 
however, one may need to worry about the accuracy of 
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fitting to the scatter plot using a simple polynomial, as 
the scatter in the scatter plot is generally large (eventu- 
ally signaling the breakdown of the local bias approxima- 
tion) . One may also worry that the large corrections to 
the tree-level results may indicate the breakdown of the 
perturbation method. 

In this paper, we develop a method, which does not 
suffer from these two potential issues. We perform a 
test without using an explicit parametrization, and so 
we effectively test local biasing without truncation. In 
fact, our test can keep all the information in the scatter 
plot. Furthermore, the correlators are computed numer- 
ically, and so we do not need to worry about the poten- 
tial breakdown of perturbation theory. The clean test 
that we develop here is complementary to the conven- 
tional tests, and helps to highlight that there is a real 
problem to be addressed in the standard formulation of 
local bias. We shall offer a new interpretation for com- 
puting the power spectrum using the local biasing pre- 
scription. Essentially we argue that the smoothing scales 
in the halo biasing prescriptions is a "knob" controlling 
the scales of the halo fluctuations that the biasing pre- 
scription measures. The bias parameters obtained with 
large smoothing window must be used to calculate large- 
scale predictions, and those with small smoothing scales 
should be used for small-scale statistics. This intuitive 
viewpoint does not seem to be widely appreciated (see 
however section V in [13]). This interpretation naturally 
accommodates the two "undesirable" effects of local bi- 
asing, the running of bias parameters in the scatter plot 
and the large loop corrections to tree-level results. 

In |24j we showed that under very general conditions 
that the local biasing prescription is not self-consistent, 
in the sense that even if galaxies are formed with local 
biasing initially, subsequent large scale gravitational evo- 
lution inevitably induces nonlocality (see also [25l \26\). 
The explicit form of the nonlocal terms up to third order 
were derived first assuming halo conservation, and then 
generalized to include a local source to take into account 
of the galaxy formation and merging. In both cases the 
structures of the nonlocal terms are the same. Ref. [23] 
also measures the specific nonlocal terms in the scatter 
plot and bispectrum (see also [27 ). Other arguments also 
lead to the existence of extra contributions to the biasing 
prescription. Ref. |28| proposes various correction terms 
using the invariants constructed from the physical fields. 
From Lagrangian perturbation theory, the halo bias is 
also known to be nonlocal [26] [29]. Here we develop a 
general test that enables us to investigate the importance 
of the nonlocality in a model-independent way. Because 
the effects of the extra nonlocal contributions is not large, 
it is vital to have a clean test. As we discussed above, 
our test is free of the the issues of standard tests, and 
this helps to pin down the effect of the extra contribu- 
tions. In this paper we focus on the halo power spectrum 
however, which turns out to be not effective in detecting 
the nonlocality. In a forthcoming paper, we will extend 
the analysis to the bispectrum. 



This paper is organized as follows. We describe the 



sampling technique in Section II A and the numerical re- 
sults are presented in Section |IIB As a comparison and 
to help interpret the sampling results, in Section |III| we 
present the results from standard Eulerian perturbation 
theory. We offer a new interpretation of Eulerian local 
biasing in Section [TV] which results in much better agree- 
ment between sampling and simulation results. As a case 
study, we study the pow er spe ctrum of a nonl ocal model 
derived in [24] in section VA In section VB we investi- 
gate how the results would be biased when a local model 
is assumed if the underlying model is nonlocal. We sum- 
marize and conclude in Section [VI] 



II. THE HALO SAMPLING METHOD 

In this section we first outline the halo sampling 
method and then present numerical results from it. 



A. Description of the sampling technique 

In this section we describe a method to construct a halo 
density field via sampling the dark matter-halo distribu- 
tion. We shall abbreviate this method as halo sampling 
or sampling. Here we outline the procedure. 

To begin with, let us suppose that we have the dark 
matter particles and halos from an iV-body simulation. 
First, we interpolate the dark matter field to a grid, and 
then Fourier transform it. The field is smoothed by ap- 
plying some window function. In this paper we shall use 
a top-hat window with radius R in real space through- 
out. After inverse Fourier transform, the smoothed dark 
matter field is obtained. The same procedure is carried 
out for the halos. At each grid point, we have the dark 
matter and halo density field pair, (5r, 5hn)i, where the 
index i denotes the coordinates of the grid. If we go on 
to make a scatter plot using these dark matter-halo pairs 
then this is the usual procedure to determine the bias 
parameters from the scatter plot. Instead, from these 
two smoothed fields, their joint probability distribution 
/(^Rj^hfl) is constructed. Note that the distribution / 
is a one-point distribution only. With the joint distribu- 
tion, for each grid point, given the dark matter field at 
that point 5r, we can construct a new halo field by draw- 
ing halo densities from the conditional probability distri- 
bution /(^hi?|<5i?.)- The new halo field is constructed by 
sampling the dark matter-halo distribution, that is why 
we called this method halo sampling. 

There is an easy way to carry out the sampling, and we 
shall dub it shuffling. We first sort (<5r, <5hii)i hi ascending 
order in Sr, and then bin them. The S^r components of 
these pairs are permuted within the same bin in Sr. The 
shuffled halo field is put back to the grid using its grid 
index i. Thus each shuffling procedure is a realization of 
the conditional distribution /(S^rISr). 
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Shuffling leaves the halo field unchanged if the halo 
field is solely determined by the dark matter density field. 
This shuffling procedure also leaves the S^r — Sr scatter 
plot unchanged. Thus we can test the local biasing pre- 
scription by reproducing the same scatter. If the biasing 
prescription has extra degrees of freedom other than Sr 
(and the stochastic noise [30]), the biasing prescription is 
called nonlocal. The presence of nonlocality will cause de- 
viation between the power spectrum of the original halo 
field and that of the sampled field. 

Suppose that the halo density is given by S^r — 
g(Sn, £), where £ collectively denotes the extra degrees of 
freedom, including the stochastic noise. When we con- 
struct the scatter plot between S^r and Sr, we project the 
function onto the £ = hypersurface. After projection, 
some information in the original relation is lost. Using 
the projected distribution to compute the statistics will 
typically lead to a biased answer. Because in our case we 
keep the information of the whole scatter plot, and we 
compute the correlators numerically, the shuffling results 
are probably the best that local biasing can ever achieve. 

As a further simplification, instead of using the whole 
distribution, we shall also replace S^r in each Sr bin by 
its mean. This simply reduces to the usual mean bias- 
ing prescription. We shall refer to this method as mean. 
Since we do not assume a specific functional form for 
the mean bias relation we effectively test Eq. (JlJ without 
truncation. However, it turns out that the differences be- 
tween the shuffling results and those using the mean are 
negligible because the shuffling operation also destroys 
any small-scale stochastic noise, including the Poisson 
shot noise. See Appendix | A 2| for more details. 



B. Numerical results of sampling 

Before presenting the numerical results, we shall give 
some details about the simulations used in this work. 
The A-body simulations consist of I280 3 particles, in 
a box of size 2400 Mpc/h, which is part of LasDamas 
project |31] ■ The cosmology is a flat ACDM model with 
n m = 0.25, fJ A = 0.75, and a s = 0.8. Thus the parti- 
cle mass is 4.57 x fO 11 M //i. The initial conditions are 
Gaussian, with a scale-invariant spectrum. The trans- 
fer function is output from CMBFAST [35]. The initial 
particle displacements are implemented using second or- 
der Lagrangian perturbation theory [33J at redshift 49. 
The simulations are done using the A-body code Gad- 
get2 [Mj. We shall average over fO realizations. 

The halos are identified using the FOF algorithm [35] 
with linking length of O.f 56 of the mean interparticle sep- 
aration. We consider halos with at least 20 particles. We 
shall focus on halos at two redshifts, 0.97 and respec- 
tively. At each redshift we further divide them into two 
groups, adjusting the boundary so that each group has 
the same number density, and hence similar shot noise 
properties. The mass and number density of the halo 
groups are shown in Table [Tj 



TABLE I. Properties of halo groups used in this paper 



halo 


redshift 


mass range 


mean v 


number density 


group 




(lO 12 M /ft) 




(Mpc/hy 3 


I 


0.97 


9.15 - 14.2 


1.97 


8.6 x 10~ 5 


II 


0.97 


greater than 14.2 


2.30 


8.6 x 10~ 5 


III 





9.15-16.5 


1.29 


2.0 x 10~ 4 


IV 





greater than 16.5 


1.61 


2.0 x 10" 4 



The dark matter particles and the halos are separately 
interpolated to grids of size 800 3 using a fourth-order 
interpolation kernel. After FFT, we apply a top-hat 
window to smooth both density fields. Then we inverse 
FFT to real space to get the density pair (Sr^^r). We 
use shuffling as the implementation of the sampling tech- 
nique. Thus we will sort the dark matter-halo pairs us- 
ing the value of the dark matter density. After sorting, 
we then bin the pairs into 40 bins, which are of equal 
spacing in Sr. The S^r components in each bin are then 
randomly permuted. When we implement the mean bias- 
ing instead of shuffling, the S^r components are replaced 
by their mean in that bin. Subsequently, the modified 
halo density field is put back to the grid, and the power 
spectrum is computed numerically. 

We now compare the power spectrum from the origi- 
nal field without smoothing with the one obtained from 
sampling the dark matter-halo distribution. For the most 
part we shall focus on the cross halo power spectrum to 
avoid dealing with shot noise subtraction, which may not 
be precisely Poisson. We will comment more on this in 
section IIVI 

In Fig. [T] we plot the bias parameter b c derived from 
the cross power spectrum between halo and dark matter 
P c and the dark matter power spectrum P m 



For the b c from simulations, the effects of the smoothing 
window cancel out neatly. This is because after multiply- 
ing the window functions, the smoothed fields are used 
to compute the power spectra directly. Therefore b c ob- 
tained using different smoothing scales is identical, and 
we only show one curve. We emphasize that changes in 
b c are equivalent to those in the halo power spectra since 
the dark matter field is not changed by the sampling pro- 
cedure. 

For small smoothing scale R, the b c from sampling is 
larger than that from the original simulation even at large 
scales. As the smoothing scale increases, the sampling 
result approaches the original simulation result at large 
scales. Among the four groups shown here, the sampled 
power spectrum is always larger than the original one 
when R is small. The sampled cross power spectrum 
agrees with the original one only when R is as large as 
50 Mpc//i. Except for the overall overshoot, b c from the 
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FIG. 1. The bias b c derived from the cross power spectrum obtained from the original simulation (lines with error bars) and 
those sampled by the shuffling method (dashed) and the mean method (dot-dashed). The panels correspond to the four halo 
groups used in this paper, whose properties are given in Table[I] Although five different smoothing scales are used, the window 
function in the cross power spectrum and the dark matter cancel out neatly in the case of the original simulation. The error 
bar is derived from the standard deviation of the cross power spectrum using f realizations. The results from shuffling almost 
completely overlap with those using the mean for each smoothing scale. The five smoothing scales are 8 16, 30, 50 and 100 
Mpc/h (from top to bottom at large scales in each subplot). For each curve, we show up to the scale at which it starts to 
rapidly oscillate due to the window function. The discrepancy at large scales will be addressed in detail in Section [rV] it does 
disappear when we properly interpret local bias: small smoothing scales are only relevant for the high-fe power spectrum. 



sampled field using small smoothing scales is scale in- 
dependent up to around k ~ 0.1/i/Mpc. We also note 
that the two sampling techniques result in almost iden- 
tical cross power spectra. The extra random noise in the 
shuffled field is averaged out when the halo field is crossed 
with the dark matter. 

The rest of the paper is mainly devoted to understand- 
ing the physical significance of the smoothing scales in the 
local biasing prescription, and in particular why there is 
such a discrepancy in the power spectrum at large scales. 
We will elaborate it in detail in Section |TV| However, the 
bottom line is rather intuitive. For each smoothing scale 
R the halo biasing prescription measures the fluctuations 
of the halo density field at that particular scale. The lo- 
cal biasing prescription with a large R characterizes the 
fluctuations of the halo density field at large scales, and 



so we are only entitled to compute the halo power spec- 
trum using that prescription for small k in the power 
spectrum. Similarly, a local biasing prescription obtained 
with a small R can only be used to compute the power 
spectrum at large k. 

First of all, we need to discuss the treatment of the 
shot noise. For the simulated halo power spectrum, we 
subtract the smoothed shot noise, which we assume to be 
the usual Poisson shot noise [35] except with the window 
function 

P ahotR (k) = j^^W 2 (kR), (3) 

where no is the mean number density of the halos. See 
Appendix | A l| for its derivation. For the halo groups used 
in this paper, their number density is given in Table [I] 
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FIG. 2. Same as Fig. [T] except for the bias t>h computed using the halo power spectrum. For the simulation halo power 
spectrum, shot noise is subtracted assuming Poisson statistics, while there is no shot noise subtraction for the sampled halos 
(see main text as to why this is a reasonable thing to do). 



We now turn to the halo power spectrum. In Fig. [5] 
where we show the bias derived from the halo (auto) 
power spectrum, that is 



(4) 



The shot noise is significant at around k ~ 0.1 — 
0.2/i/Mpc, at which the amount of noise subtracted be- 

). Furthcr- 
ue to halo 




comes comparable to the signal (see Fig. 13 



more, the shot noise may be sub-Poisson 
exclusion effects, which is relatively more important for 
massive halos [17] . There may be other types of stochas- 
tic noise as well. Complications in treating the shot noise 
will prevent us from comparing the theory with simula- 
tions in a precise way at 3> 0.1/i/Mpc for the halo 
power spectrum. Most of the work in this paper deals 
with the cross power spectrum for this reason. However 
we will come back to the halo power spectrum in sec- 



tion IV although we will concentrate there in the low-fc 
limit where shot noise issues are less of a concern. 

As mentioned, due to discreteness, the halo density 
field directly extracted from simulations has shot noise. 



If we construct the sampled halo field by drawing from a 
continuous dark matter-halo distribution or simply using 
the mean biasing relation, the correlation of the sampled 
field is endowed from the dark matter field, and thus it 
does not have shot noise. However, it is not so obvi- 
ous that the halo density field loses its shot noise after 
shuffling. The shot noise originates from the discrete na- 
ture of the halos. Heuristically, after smoothing, the ha- 
los spread out and shuffling further distributes the halo 
clouds everywhere, and so the correlations due to the dis- 
crete nature of halos is destroyed. In fact, any stochastic 
noise (not correlated with S) is in general is destroyed 
by the shuffling process (we have verified this experimen- 
tally by generating random particles and performing the 
shuffling process to the corresponding density field). See 
Appendix |A 2| for more details. Thus we do not sub- 
tract shot noise from the sampled halo power spectrum 
in Fig. [2j while we do subtract Poisson shot noise using 
Eq. (§ fr om the numerical power spectrum (where no 
shuffling has been done). 

However, if the Poisson shot noise is not accurate 
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enough or there are additional stochastic noises there 
will be discrepancy between the sampling results and the 
simulation results. Indeed, even for smoothing scales of 
100Mpc//i, there are some hints of percent-level devia- 
tions between the original power spectrum and the sam- 
pled one at large scales for halo group III (see bottom 
left panel in Fig. [2| . It is possible to interpret these devi- 
ations as a small extra white noise contribution (giving a 
total super- Poisson shot noise), as opposed to a slightly 
different bias, since the matter power spectrum does not 
vary that much over the range of scales k < 0.03/i/Mpc. 
From the cross spectrum in Fig. [T] we also see better 
agreement at large scales between the sampling and the 
original simulation results. However, note that these ha- 
los are not well resolved by our simulations. On the other 
hand, for groups II and IV the simulation results for the 
halo spectrum are slightly higher than those from sam- 
pling at large scales (an effect that is again not seen in 
the cross-power spectrum results). This might be due to 
small sub-Poisson contributions coming from halo exclu- 



sion. We'll come back to these points in Section IV 



The difference between the two implementations of 
sampling techniques (shuffling and mean) is that the one- 
point stochastic scatter about the mean is preserved in 
the shuffling method. When computing the cross power 
spectrum this stochastic noise does not contribute. But 
the stochastic noise may contribute a small amount of 
white noise to the halo auto power spectrum. However, 
we argue in Appendix |A 2| that the shuffling procedure 
destroys the contribution of this stochastic noise to the 
power spectrum. This agrees with what we see by com- 
paring the auto halo power spectrum from the shuffling 
and the mean methods. 

Note that in Fig. [2] is qualitatively similar to b c 
in Fig. [Tj in particular as the smoothing scale increases, 
6h from the sampled field decreases and approaches that 
from the original simulation. We note that the 6h and 
b c from sampling agree with each other at large scales 
for large smoothing scales such as 50Mpc//i (modulo the 
small differences pointed out above). This is an impor- 
tant point, and we will reiterate it after we re-interpret 
local biasing prescription. Because of the problem of shot 
noise subtraction in the original (pre-shuffled) simulation, 
in this paper we shall mainly rely on the cross power spec- 
trum when we compare results to simulations, except in 
Section [TV] where we address the low-A: regime of the auto 
power spectrum. 

In order to shed light on the shuffling results, in Sec- 
tion |III| we shall apply perturbation theory to the local 
model. We shall see that the large-scale power problem 
for a halo biasing prescription with a small smoothing 
scale already appears in the conventional test of local 
biasing prescriptions using perturbation theory. How- 
ever, since the problem is particularly obvious for small 
smoothing scales, and one may worry about the accu- 
racy of fitting the bias parameters in the scatter plot and 
the breakdown of perturbation theory when the variance 
is not small. The sampling technique introduced here 



evades these issues, and makes clear that this is a real 
problem of the standard formulation of the local biasing 
prescription. We will show that the seemingly chaotic re- 
sults from sampling in Fig. [T] and [2] can be brought into 
nice agreement with the original simulation results when 
a new interpretation is offered in Section |rv[ 



III. POWER SPECTRUM FROM 
PERTURBATION THEORY: LOCAL MODEL 



It is useful to compare the numerical power spectrum 
we obtained through sampling against those obtained 
from standard perturbation theory [37] . We shall use 
the standard perturbative expansion with smoothing [21 
because it is most directly related to our procedures. 



From the numerical results in Section II B it is already 
clear that they are qualitatively similar to the well-known 
results that when higher order loop corrections are in- 
cluded, they generally give rise to significant corrections 
even to large scales, unless the smoothing scales are suf- 
ficiently large [2"T] . There is an alternative procedure 
making use of the idea of renormalization to write these 
results in a different way |22j . However, after renormal- 
ization, the bias parameters become free parameters, and 
cannot be related to other bias parameters such as those 
in the scatter plot. Our ultimate goal is to establish 
consistent links between the bias parameters obtained 
from different means, such as the scatter plot and the 
power spectrum, therefore we shall not consider the spe- 
cific scheme of renormalized bias advocated in [52] . But 
we will comment on it more in Section IIVI since it has 
distinct predictions from our scheme. 

In standard perturbation theory with smoothing, the 
nonlinear matter density field is smoothed. The pertur- 
bation series is expanded in terms of the linear matter 
power spectrum. When the next to tree-level correction 
is considered, all the f-loop correction terms should be 
included. Here we summarize the results of the pertur- 
bative expansion of the halo cross power spectrum and 
auto power spectrum up to I-loop, which will be needed 
later on. 

The perturbative expansion for the halo cross and auto 
halo power spectrum up to f-loop order can be repre- 
sented diagrammatically as in Fig. [3] and [4] respectively. 
A diagrammatic representation for the biased density 
field has been used before, see e.g. [Ill [3H1 [35] ■ In each di- 
agram, each line with arrow represents an external wave 
vector, a solid dot denotes a bias parameter vertex, and 
the branching of a solid line into wavy lines represents 
the vertex function due to the nonlinearity of gravity. 
Each wavy line represents the linear density field, and 
the merging of two wavy lines stands for a linear matter 
power spectrum. The meaning of our notation for the 
power spectrum can be illustrated by the example, P^ 2 - 
The digit in front of the comma is for the dark matter 
field, while the number of digits behind the comma de- 
notes the degree of biasing in the halo biasing prescrip- 
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tion. In our example, it means it is of the form (<5r&2<5|j), 
in particular the degree of biasing is quadratic (62) as 
there are two digits behind the comma. The numerical 
value of each digit denotes the order of each 8r in the 
dark matter perturbation theory. In the example, the 
first factor (dark matter field) is linear, while the second 
factor (the halo part) contains a linear and a second order 
dark matter field. For the halo auto power spectrum, the 
notation is similar except that we have the dark matter 
field replaced by the halo field. 

The nonlinear effect of gravity is represented by wavy 
lines, we can imagine shrinking the wavy lines, or 
"springs", to a point, and so we see that the effect of 
gravity is to generate an effective 2-point function (the 
first rows in Fig. [3] and pfl), effective connected 3-point 
function (the first two diagrams in the second row of 
Fig. [3] and the second row in Fig. [4j for the nonlinear 
density field, and a connected 4-point function (P^jj' 11 ). 
The terms with b§ gives rise to a product of power spec- 
tra. 

Here we shall consider quadratic and cubic local bi- 
asing models, i.e. i runs up to 2 and 3 respectively in 
Eq. (JlJ . Using perturbation theory, or simply reading off 
from Fig. [3] with the appropriate symmetry factors, we 
can expand the halo cross power spectrum as follows: 

P b S(k) = b?W 2 (kR)P(k) = P c y + P 2 J + P^, (5) 



P 2 c £\k)=b*W(kR) J dVa(q,k-q) 



(6) 



x W(qR)W(\k~q\R)P L (q)P h (\k-q\), 
P^ 2 (k) = 2b«P L (k)W(kR) J d 3 qF 2 (-k,q) (7) 
X W(qR)W(\k-q\R)P L (q), 
P^ U (k) = b fW 2 (kR)P L (k)a 2 R , (8) 
where the mode coupling kernel F2 is given by 



7 2 v g p' 



(9) 



with fx — p q, and o\ denotes the linear RMS fluctuation 
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d 3 qW 2 (qR)P L (q). 



(10) 



Eq. ^ consists of the three terms proportional to bf, 
corresponding to the three diagrams in the first row in 

r~l 1122 

Fig. 3 We have grouped these terms P c ^ , P c j i and 
1 3 — 

P c jj together, as these three terms can be resumed to 
one term if we use the nonlinear matter power spectrum 
P, instead of the linear Pl- To do so, we shall use the 
nonlinear matter power spectrum P measured from sim- 
ulation. Thanks to the property of the P2 kernel, Eq. §6§ 
vanishes as k goes to zero. 

If the window functions are not present, Eq. ([7]) can be 
computed analytically, and it is equal to (34/2lJbfa 2 Pi J , 
which is proportional to Pl and is still significant at large 



scales. This must be approximately true in the presence 
of the window functions. Thus this term contributes 
to the excess power at large scales when the smooth- 
ing scales are not small. Eq. ^ is also proportional to 
Pl, thus this is another contribution to large scale power 
from loop corrections. For the quadratic model, the term 
P^jj 111 vanishes by fiat. 

We can carry out a similar expansion for the halo auto 
power spectrum: 



{b*) 2 W 2 {kR)P{k) = P^ + Pl 2 + P^l) 



2b?h$W(kR) J d 3 <?F 2 (q,k-q) 



(12) 



X W(qR)W(\k - q|P)P L (q)P L (|k - q|), 
P$ > (k)=4h?b?P L (k)W(kR) J dVM-k,q) (13) 



x W(qR)W(\k 



■q\R)PL(q), 



d i qW 2 {qR)P L (q) 



(14) 



J& U1 (*) 



2 

IU 2 (|k-q|P)P L (|k-q|), 

b?b§W 2 (kR)P L (k)a 2 R . (15) 

The terms are similar to those for the cross power spec- 
trum, except that there is a new term Eq. ( 14 ) . For 
small fc, one can expand the integrand in Eq. ( 14 ) about 
k, and the leading correction to the constant term is k 2 
only. Thus this term is rather constant at large scales, 
and its effect is like a shot noise term [2TJ [22] . Again, 



we can resum the terms proportional to (&f ) 2 , P h ' R , P i 



and Pj,]j, using the nonlinear matter power spectrum. 

In order to consistently compare with the numerical 
results, we shall determine the bias parameters by fitting 
to the scatter plot between 5r and S^r. There are two 
natural ways to fit to a scatter plot. One can fit simply 
to all the data points in the scatter plot. In this way, the 
fitting is roughly weighted by the number of data points, 
and so the fitting is dominated the region in the vicinity 
of the origin. Alternatively, one can bin the points based 
on its value of Sr, and compute the mean and spread of 
<5h_R iu each bin. Fitting is made to the mean of the bins 
using the spread as the error bar. In this way, we fit to 
the global shape of the <5r — S^r distribution. A priori 
it is not clear which way is the best approach. Some- 
times, these two methods result in small but significant 
differences. Our goal here is to determine the halo power 
spectrum using the 1-point distribution determined from 
the scatter plot and the power spectum of the dark mat- 
ter. 

When the smoothing scale is large, e.g. 100 Mpc/h, 
the scatter in the scatter plot is small and the range of 5 
is narrow, and it is well fitted by a quadratic polynomial. 
In fact there is significant degeneracy between the bias 
parameters. However, we find that the ones obtained by 
fitting to all the data points directly, i.e. simply weighted 
by the number of data points, recovers the bias param- 
eter in the power spectrum most closely, which is sim- 
ply equal to bf^ for such large smoothing scale. The one 
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FIG. 3. The diagrammatic representation of the perturbative expansion of the halo cross power spectrum up to 1-loop order. 
See the text for the meaning of the notation. 
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FIG. 4. The diagrammatic representation of the perturbative expansion of the halo power spectrum up to 1-loop order. 



obtained by fitting to the global shape underestimates 
the bias parameter in the power spectrum sightly. Of 
course this does not guarantee that this is still the best 
approach in the weakly nonlinear regime. Nonetheless, 
we have checked that both methods give qualitatively 
similar results in recovering the bias parameters in the 
power spectrum in the weakly nonlinear regime. There- 
fore we adopt the method of fitting to all the data points 
in this paper. 

To eliminate the influence of the outlier points, we have 
removed the data points whose Sr falls within the largest 
or smallest 2%. InFig.[5j we show the best fit bias param- 
eters for various halo groups using five different smooth- 
ing scales. We have considered both the quadratic and 
cubic biasing models. We note that the bias parame- 
ters generally "run" as the smoothing scale varies. The 
higher the order of the bias parameter the more dras- 
tic the running it is. Similar results for the quadratic 
model have been reported in [18] . Detailed comparison 
is difficult as the halos are not exactly the same, but 



[T%] reported that 6f starts to approach as R decreases 
when R < 20Mpc//i, which is at odds with our results. 

We note that the higher the degree of the bias param- 
eter, the larger its error bar, as expected. In particular, 
the error bars of b% are so large in some cases for R > 50 
Mpc/h that it is consistent with b§ being 0. We expect 
that there is large degeneracy between the high degree 
bias parameters when R is large because for large R, a\ 
is small, the data points are in close proximity of <5/j = 
only. Thus b$ cannot be reliably measured for large R. 

In [19] , a Gaussian smoothing window instead of top- 
hat is used, and they generally do not use multiple win- 
dow sizes to measure the same halo group, so direct com- 
parison is not possible. However, we note that using a 
Gaussian window, which has a weaker tail than top-hat 
window in Fourier space, results in milder running of bias 
parameters. Ref. [19] argued that third degree biasing de- 
cription is much favoured than the quadratic one using 
the AIC information criterion. Here the large error for 
b$ for large smoothing suggests that the data is not con- 
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FIG. 5. 



The bias parameters for the quadratic and cubic biasing models as a function of the smoothing scale. The blue lines 



are for the quadratic model, while for the sake of clarity, we have used the red lines (b H and b R ) and yellow lines (b R ) for the 
cubic model. The error bars here are 2 a. The bias parameters generally run with the smoothing scale R. The lower the order 
of the bias parameter, the smaller the running. For R > 30 Mpc/h, the running of the bias parameters is generally small except 
perhaps for b R (which unfortunately becomes increasingly difficult to measure at large smoothing scales). 



straining on this additional parameter, and it may not be 
necessary. When the smoothing scale is small, the error 
bar is small, and so one may think that including b§ is 
justified. However, we will see that the presence of b§ of- 
ten causes larger discrepancies with the A^-body results 
when they are used to compute the halo power spectrum. 

With the bias parameters fitted from the scatter plot, 
we can now compute the halo cross power spectrum and 
the auto power spectrum using Eqs. ([5jj8| and Eqs. ( |11| 
15) respectively. In Fig. [6] we compare the b c obtained 
from the original simulation and the one from perturba- 
tion theory using the bias parameters from a quadratic 
local biasing model. As R increases, b c from perturbation 
theory decreases at large scales and the agreement with 
the original simulation results improves. This is in quali- 
tative agreement with that from sampling (Fig.[T]). When 
the cubic biasing model is used instead, the results are 



qualitatively similar to those from the quadratic model, 
except that for small smoothing scales, the deviations at 
large scales from the original simulation are slightly big- 
ger. Also for bh, the results are also qualitatively similar 
to the those of b c , and they are consistent with those 
from sampling method in Fig. [2] so we do not show them 
here. 

The perturbation theory and sampling "fail" to re- 
produce the original halo power spectrum in a similar 
way. Using the sampling results, we can eliminate the 
two loopholes in the standard test of local biasing pre- 
scription, the inaccuracy of fitting to the scatter plot and 
the breakdown of perturbation theory. Thus we conclude 
that the problem originates from the standard interpre- 
tation of the local biasing prescription, where there is 
no relation of the smoothing scale R to the "observation 
scale" k. 
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FIG. 6. Comparison of b c computed from the quadratic local biasing model compared to simulations (solid lines). The bias 
parameters are measured from the scatter plot and then used to make predictions, assuming a fixed smoothing scale R as 
shown. The perturbative calculation includes all the one-loop corrections in the quadratic bias model. Each curve is shown up 
to the scale at which it starts to oscillate due to the window function. Note the similarity of the "failure" of these predictions 
with the sampling results in Fig. 1. 



From the exercise in this section we have encountered 
two peculiar properties of local biasing related to the 
smoothing scale R: the scatter plot bias parameters run 
with R and the power spectrum varies with R even at 
large scales. These two properties can be accommodated 
if we interpreted the smoothing scale R as the "knob" 
that controls the scale of the fluctuations that the bias- 
ing prescriptions measure. We now discuss this in more 
detail. 



LOCAL BIASING RE-INTERPRETED 



IV. 



In Section [TTT| we have presented the standard proce- 
dure to compute the halo power spectrum using Eule- 
rian local biasing with smoothing. This procedure im- 
plicitly assumes that using the biasing relation obtained 
by smoothing the dark matter and halo fields at one scale 
R is sufficient to compute the halo power spectrum at all 
scales. However, the window function of size R filters out 
the small scale fluctuations, and so the local biasing re- 
lation gives the information mainly on the scale R. This 
is reflected from the fact that for wave number ^ R~ l , 



the window function causes rapid decay of the power. 

We have seen that in Section IIIII that there are two 
issues with the local biasing prescription related to the 
smoothing scale R. In Fig. [5] we saw that the scatter 
plot bias parameters in fact are scale-dependent. We call 
this the running of bias. In the standard approach out- 
lined previously, the running of bias is not incorporated. 
Rather, the scatter plot bias parameters are measured at 
one scale and applied to all scales. This assumes that 

vanishes. One may try to model the running of bias pa- 
rameters, for example, using the peak background split 
or peak bias formalisms [3J |3D] . A more practical way is 
to measure the scatter plot bias at various scales as in 
Fig. [5j Another would be to use a bispectrum analysis 
to get the running of bias parameters directly in Fourier 
space, but in this paper we are mostly interested in mak- 
ing connection with the literature where bias parameters 
are measured from scatter plots. 

Another issue is that at large scales the power spec- 
trum computed using the local biasing prescription de- 
pends on the smoothing scale [3T] , as evident from Fig. [I] 
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[2] and [6j This is a long-standing problem of using the lo- 
cal biasing prescription to compute the power spectrum. 

Here we argue that these two peculiar properties of 
the local biasing prescription call for a proper interpreta- 
tion of the physical significance of the smoothing scale R. 
We propose that each biasing prescription specified by R 
only enables us to calculate the Fourier space fluctuation 
at one scale corresponding to R. To our knowledge, this 
intuitive viewpoint on the physical significance of R has 
not been appreciated (see however |23j). The smoothing 
scale is generally regarded as unphysical, or one smooth- 
ing scale is sufficient. Perhaps, this is partly because the- 
oretical predictions such as the peak-background split, at 
least as it is often implemented, give only one set of bias 
parameters. 

We interpret the local bias prescription obtained with 
smoothing window of scale R as furnishing a good ef- 
fective description at that scale. As any other effective 
description, it is valid only in certain regime, and we 
interpret the large discrepancy at large scales that we 
see in computing the power spectrum as a consequence 
that we apply the prescription outside its regime of va- 
lidity. Taking this into account, we can naturally "solve" 
those two problems in the standard formulation of local 
bias, where there is no a priori connection between the 
smoothing scale R and the observation scale (k in the 
power spectrum case). 

Therefore, for each set of bias parameters measured 
from a scatter plot using a window of size R, we will 
use them to compute the power spectrum at a scale of 
roughly 1/R in Fourier space. Thus, each measurement 
of the bias parameters from a scatter plot only yields 
a data point in the power spectrum calculation, rather 
than the whole function as in the usual procedure. For 
the real-space top-hat window used in this work, which 
is rather extended in Fourier space, we have some free- 
dom in deciding the precise Fourier mode it corresponds 
to. In this paper we regard the local biasing prescription 
as a large-scale effective description. On the other hand, 
the simulation results can be regarded as the "full the- 
ory" . We can fix this degree of freedom by requiring that 
the effective description results match those of the "full 
theory" . We will see that the following mapping 



in real space, to a band power in Fourier space. That is, 
one can take a band-power window given by 



R 



(17) 



results in remarkable agreement between the sampling 
and the original simulations results. For the power spec- 
trum, the proper window to consider, is window function 
squared. It turns out that the scale k^ corresponds to the 
half- width-half-maximum of W 2 (kR). We do not have a 
deeper justification for this prescription than what was 
just stated. However, there are cases in which this issue 
can be made more precise. 

Let us now discuss two examples where the mapping 
between the smoothing scale R and the observational 
scale (k in the power spectrum) are obvious. The first 
would be to change the window function from a top-hat 



W k , Ak (q) = [9{q-k 



Ak 

-9{q-k-—)l (18) 



which extracts a shell of width Afc centering about k. 
Thus the scatter plot between 8 and <5h smoothed with 
the window Wk t Ak(q) gives the biasing prescription de- 
scribing the fluctuations of the halo density field at k. In 
this case, the mapping between the window smoothing 
scale and the observation scale in the power spectrum 
is completely fixed, as both are Fourier space quantities. 
However, the window Eq. ( 18 ) is not convenient because 



it is highly nonlocal in real space and cannot be nor- 
malized in usual way, i.e. its integral over all the space 
vanishes. The second example, is to keep the window 
a top-hat in real space, but calculate correlators in real 
space, in particular the variance of counts in cells at scale 
R. Then it's obvious as well that the mapping between 
smoothing scale R and observation scale R is one-to-one 
as well, there is no freedom at all. In other words, note 
that our "new" interpretation, when formulated in real 
space for count-in-cells statistics is nothing else that the 
standard well-known expression in Eq. |lj) for local bias 
originally put forward by Fry & Gaztanaga [S]. 

In addition, note that loop corrections when calculat- 
ing count-in-cells statistics will be suppressed at large i?, 
e.g. linear bias renormalization vanishes for large cells 
since for the variance of the halo field, 



_2 



h 2 ^r 2 
°1 a R 



bib 2 S 3 )aj i 



jS,a%, (19) 



where we have kept only terms up to quadratic bias, and 
S3 and 5*4 are the usual skewness and kurtosis param- 
eters. Clearly, as R — > 00 the loop corrections coming 
from nonlinear bias are highly suppressed compared to 
the linear term (that is, linear bias renormalization van- 
ishes), and there is no extra noise contribution coming 
from nonlinear bias (in the language of [55] "shot-noise 
renormalization" vanishes). In particular, since there is 
no extra noise generated by nonlinear bias in the variance 
of the halo field er^, there should be no extra noise in the 
low-fc halo power spectrum. Our "re-intrepretation" of 
local bias in Fourier space obeys these two properties as 
well. 

Let us now go back to address the two issues at hand 
with the standard interpretation of local bias, and how 
our prescription solves these problems. Note that be- 
cause the bias parameters are running, strictly speaking, 
we have to integrate over the bias parameters in comput- 
ing the loop corrections too. However, the bias parame- 
ters vary less rapidly than the power spectrum, and the 
window functions VF(|kfl — q|i?) in the integrand ensures 
that the dominant contribution to the integral is close 
to fcij. Thus it is a reasonable approximation to take the 
bias parameters outside the integral and assign the values 
at the scale fcfj to them. 
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FIG. 7. Comparison of the b c from the original simulation against the sampling and the perturbation results for the quadratic 
and cubic model output only at kn given by Eq. (17 1 for each smoothing scale. The sampling results, both shuffling and mean, 
agree with the original simulation within 1% (so it is difficult to differentiate them in the plot). For both the quadratic (solid 
symbols) and cubic (empty symbols) models, the 1-loop perturbation theory (circles) results generally improve the tree-level 
results (squares), except for the group IV. Our interpretation of local bias solves the problems raised in Figs. [I] and [5] 



For each smoothing scale, our interpretation is that we 
should only take the point at kn from Fig. [I] for the power 
spectrum calculated with bias parameters obtained from 
the scatter plot smoothed at R. Doing this we thus con- 
struct the plot in Fig. [7] Both the points from shuffling 
and mean almost coincide with the b c from the original 
simulation, thus solving the large-scale excessive power 
issue in Figs. [I] (and [2). Up to 0.22 h/Mpc, for the four 
different halo groups, the b c from sampling deviates from 
the one from original simulation by less than 1%. This 
result is not very sensitive to the choice of the factor in 
Eq. (fl7| ). For example, if 2 is chosen instead of 1.8 in 
Eq. ( |17[ ), the maximum deviation between the sampling 
results and the simulation is still within 2%. 

In Figure [7J we have also shown the points at kn taken 
from Fig. [6] (the quadratic model) and the cubic model, 
which we did not show explicitly before. Let's first fo- 
cus on the quadratic model. Generally by adding the 
loop corrections to the tree-level results, the agreement 
between theory and simulation indeed improves, except 
for the halo group IV at k ~ 0.2 h/Mpc. At redshift 
0.97, with loop corrections, the difference between the- 
ory and simulation is within 4% up to k = 0.11 ft,/Mpc, 



and within 10% up to k — 0.22 h/Mpc. The agreement 
deteriorates at redshift 0, and the agreement is within 
7% up to k ~ 0.11 h/Mpc. For the group IV, it is the 
only case in which the tree-level results fair better than 
the one including loop corrections. The pathological be- 
havior of the perturbation results may be correlated with 
the fact that changes sign as R decreases, from nega- 
tive to positive, for the halo group IV. Overall as redshift 
decreases, the nonlinearity is stronger, and so standard 
perturbation theory is expected to break down at smaller 
k. Note that similar levels of disagreement have been 
seen before for the perturbative calculation, e.g. see |17j . 
Here the deviations are slightly larger since our linear bias 
from the scatter plot does not agree as precisely with the 
low-fc matter-halo power spectrum bias. Of course, one 
could improve significantly the performance of the per- 
turbative calculation by fitting for the bias parameters 
that best match the simulation power spectrum measure- 
ments. That would have little to say though on the issues 
we are interested in this paper. 

Now we compare the quadratic model with the cubic 
one. In general, they are more or less similar, except that 
the cubic one is less stable and often fairs worse than the 
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quadratic model at large fc. For large smoothing scales, 
the error bar on b§ is large, so it is consistent with zero. 
From Fig. [7j because of large smoothing, its contribution 
is negligible, and so it is similar to the quadratic model. 
However, for small smoothing scales, such as 8 Mpc/h, 
from Fig. [5] we see that the error bar of is small and 
it is clearly negative. However, from Fig. [7j we see that 
the cubic model deviates further away from the simula- 
tion than the quadratic model. This is likely because 
for small smoothing scale the validity of the non-linear 
local bias expansion is very limited. Thus we prefer the 
quadratic model to the cubic one because of its simplicity 
and stability. 

Note that in our proposal there is a significant differ- 
ence on the impact of loop corrections to the low-fc halo 
power spectrum. Indeed, our interpretation predicts that 
at large scales, the bias is simply given by bf with R — > oo 
and thus with negligible loop corrections. On the other 
hand, in the literature small smoothing scales are some- 
times applied and the loop corrections are argued to be 
significant in the low-fc halo power spectrum, e.g. (HI 142) . 
The reason for this, as discussed before, is that the term 



shot noise contribution is modified to Eq. (A12|. Mass 



weighting has little effect on the low mass halo groups 
since our mass bin is quite narrow, while it increases the 
auto power spectrum and the shot noise significantly for 



in Eq. ( |14| is quite close to constant at large scales, and 

For non-Gaussian ini- 



so it resembles shot noise [211 E2] 
tial conditions of local type, similar loop corrections from 
nonlinear bias (proportional to b^crj^) can lead to even 
stronger corrections at low-fc in the standard implemen- 
tation of local bias [42]. However, as pointed out in [23] 
such results rely on using fixed smoothing scales unre- 
lated to the scale of fluctuations fc. 

At scales larger than the turnover of the halo power 
spectrum, any white noise starts to become increasingly 
important in the k — > limit. In 41 , it was argued that 
this shot noise effect maybe observable if the loop correc- 
tion is large enough. Adopting our interpretation of local 
biasing, however, at k ~ 0.005 h/Mpc the corresponding 
smoothing scale should be of a few hundred Mpc, which 
makes the loop corrections due to nonlinear bias vanish- 
ingly small. Even for 50 Mpc//i, the correction given by 
Eq. (14 1 is of the order of 10~ 3 -10~ 2 of the tree-level 
term. 



Recall that in Section [lIB| we mentioned that b^ and b c 
from sampling agree with each other at large scales when 
large smoothing scales are applied. With our interpre- 
tation of local biasing prescription, the sampling results 
indicate that b^ and b c from local biasing prescription 
agree with each other. In particular, Eq. ( 14 1 has negli- 
gible effects at large scales. We now provide additional 
tests of this property. 

We performed an alternative test by fitting a linear bias 
model to the Poisson-subtracted halo auto power spec- 
trum at large scales. Besides the standard equal weight- 
ing of all halos in a given mass bin, which is the only 
scheme that we used so far, we now also consider mass 
weighting with weight m and m 2 , where m is the mass 
of the halo. Mass weighting enables us to preferentially 
weight the more massive and hence more biased halos in 
the sample. When the halos are weighted, the Poisson 



high-mass halo groups. In Fig. 13 we show the halo auto 
power spectrum and the shot noise for each of the halo 
group. In particular for mass weighting by m 2 the shot 
noise is higher than the signal at most of scales of interest 
(k < 0.1 /i/Mpc). Also, mass weighting gives more weight 
to massive halos, which is expected to lead to larger halo 
exclusion effects. On the other hand, as the more biased 
objects are preferentially weighted, the magnitude of 62 
is expected to increase. The contribution due to Eq. ( 14) 
will increase too. However, according to our proposal, 
this term is still negligible at large scales even with mass 
weighting. 

The results are presented in Figure [8] The top panel 
shows the halo power spectrum with Poisson shot noise 
subtracted (data points), and a linear bias model b\ P 
(solid lines) , where P is the measured matter power spec- 
trum. The fit to b\ is done using data up to fc max = 

0. 03/i/Mpc, safely within the linear bias regime for all 
halo samples. 

Because we subtracted Poisson shot noise, any shot 
noise renormalization at low-fc will show up as residu- 
als from our fit. The bottom panels in Fig. [8] show the 
residuals in terms of the fractional deviations given by 
{P h -b\P)/P h . Up to fc ~ 0.004 h/Mpc, for equal weight- 
ing (m° weighting) the simulation results are consistent 
with our statement that local biasing predicts that the 
k — > power spectrum is given by the linear bias alone, 

1. e. there is no evidence for shot noise renormalization. 
These test results alone only constrain that the sum of 
the correction by Eq. ( 14 ) , the sub- Poisson effects due to 



halo exclusion and any other stochastic noise are small. 
The drop in power at large scales for mass weighting is 
consistent with the expectation that mass weighting in- 
creases the sub-Poisson effect due to halo exclusion. We 
do not see any evidence for an increase in the noise level 
expected from nonlinear bias in the "standard" interpre- 
tation of local bias as massive halos are up-weighted. If 
that contribution is there, it must always be somehow 
overcompensated by the negative noise coming from ex- 
clusion. 

An alternative look for extra noise at low-fc can be ob- 
tained, as discussed in Section [II B[ by comparing cross 
and auto bias, b c and b^. In our interpretation of local 
bias, these two should agree at low-fc modulo the effects 
of halo exclusion (which makes bh < b c ), whereas in the 
"standard interpretation" shot noise renormalization due 
to nonlinear bias would make b^ > b c (again, modulo 
halo-exclusion effects). Correcting for halo-exclusion ef- 
fects is nontrivial, the two-point function of discrete halos 
with exclusion ^ can be written as (see Appendix A2 in 

02]) 



1 + Mr) = [1 + a(r)][l + h(r)} + - J D (r) (20) 

n 
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FIG. 8. The upper panels show the measured halo power spectrum from simulations (symbols) and the best fit linear bias 
model b\P (lines) for each halo group. Three weighting schemes m k , with k = 0, 1 and 2 are shown. Mass weighting has little 
effects on the low mass groups, while it changes the power of the high mass groups. The best fit fei is obtained by fitting to 
the simulation Poisson corrected halo power spectrum only up to fc max = 0.03 /t/Mpc. The bottom panels show the residuals in 
terms of fractional deviations (P^ — b\P)/P^. Up to k ~ 0.004 ft/Mpc, for equal weighting, the simulation results are consistent 
with our prediction that the low-fc Poisson corrected halo power spectrum is well described by linear bias, with no evidence of 
shot-noise renormalization from nonlinear bias loops. For mass weighting by m and m 2 , we detect the sub-Poisson effects due 
to halo exclusion. 



where £/, is the continuous version and h is the two-point 
function of random hard-spheres. In the approximation 
we are working in cosmology, where the number den- 
sity n of halos is small (otherwise see [SHU]), we can 
approximate h(r) = — 1 for r < r e and zero otherwise, 
where the exclusion radius r e is e.g. twice the halo radius 
when dealing with equal mass objects. This means that 
h(r) = —WTn(r,r e ) where Wth is the spherical top-hat 
window function and we have for the continuous power 
spectrum [PT] . 

Ph(k) = P d (k) - — ^ + WMkr e ) 

+ Jw Tli {qr e )P h {\k-q\)d 3 q (21) 

While we see straight away that short-range exclusion 
from hard-sphere halos contributes opposite to the Pois- 
son noise from discreteness, there is an issue in that the 
correction for exclusion depends on the continuous power 



spectrum itself through the convolution term at scales 
comparable and below the exclusion radius. This term is 
the largest contribution due to exclusion (with the sub- 
dominant term WTn(kr e ) becoming just the exclusion 
volume in the low-fc limit). In [17] Ph in the convolution 
term is modeled by assuming the two-point function is 
a power-law at small scales, but this is not enough for 
our purposes as any deviations from it will result in un- 
accounted noise, precisely what we want to test. It is 
difficult to model Ph close to the exclusion radius from 
first principles, e.g. local bias by definition breaks down 
at these scales. Therefore, we won't attempt to correct 
for exclusion effects in this paper. 

We can still look at low mass halos where exclusion 
effects are negligible and see whether we detect any ex- 
tra noise, i.e. whether we see any evidence of bh > b c . 
The simulations we have presented so far have too poor 
resolution for this (the particle mass is 4.57 x 10 11 M / h) 
but we have also looked at higher-resolution simulations 
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of the same cosmology (the Carmen runs in the Las- 
Damas simulations) with particle mass 4.94 x 1O 1O M0 /h. 
In these simulations, we have studied halos in different 
mass bins, as low as 10 12 Mq/Ii, with linear bias less 
than unity. We have found that the ratio bh/b c is al- 
ways within one percent of unity at the largest scales 
we can probe, of order k = 0.006/i/Mpc. For reference, 
we quote this ratio for the halo samples used throughout 
this paper: bh/b c = 1,0.98,1.015,0.99 for halo samples 
I through IV. When splitting the same halos in different 
bins, e.g. 10 13 - 10 14 M Q //i and above 10 14 M Q /h we find 
b h /b c = 1,0.92 at z = and b h /b c = 1,0.93 at z = 1. 
Thus while there is evidence for halo exclusion for high- 
mass halos (which also shows up when mass weighting 
the wider bins), only for sample III there is very slight 
tendency for bh/b c to be above unity. However, when 
we look at the higher-resolution simulations of the same 
halos in sample III (that have between 162 and 304 par- 
ticles rather than 20 to 36 particles in the low resolution 
runs), even this slight hint of a discrepancy is reduced 
to consistent with unity. This is perhaps not surprising, 
halos with as few as 20 to 40 particles in the low resolu- 
tion run have bias factors that differ systematically from 
the same halos in the high resolution run by almost 10%, 
thus one should not trust their clustering properties at 
the percent level despite of their small statistical error 
bars. 



We consider the simple model in which the initial bias is 
local and described by the bias parameters b*, b\ and b\ 
at time t», subsequent gravitational evolution will then 
induce nonlocal bias. At time t > t*, the terms in the bi- 
asing prescription can be classified into two types: those 
present initially, i.e. the local type, and the nonlocal 
terms that are induced by subsequent evolution. Here 
we will focus on the nonlocal terms. For simplicity we 
assume that halos are conserved. When the halo con- 
servation assumption is relaxed, and some local source 
function describing galaxy/halo formation and merging 
is introduced, the structure of the induced kernels re- 
mains the same as that derived from the conserved model 
(see [24] for details). 

At t > t*, to the third order, the halo density is given 
by M 

S h = hS + ^S 2 + b -^S 3 + 72 e 2 ($v)(l + PS) (22) 
2 6 

+ 73[S3($v) + ^2($i 1) ,$2LPT)] ) 

where Qi and Qz are given by 

£ 2 ($v) = (V tJ $ v ) 2 - (V 2 $ v ) 2 , (23) 
£ 3 ($v) = (V 2 $ v ) 3 + 2Vy$vV J - fc $vVfci$v (24) 
- 3(V y $ v ) 2 V 2 $ v , 



A CASE STUDY: POWER SPECTRUM OF 
A NONLOCAL MODEL 



with $ v being the velocity potential, ^(^v, ^2LPt) is a 
variant of Qi(^w) and it is defined as 



The halo sampling method can be applied to detect 
the existence of extra nonlocal contributions to halo bias 
since the sampled halo field should result in the same 
halo density field if local bias holds. On the other hand, 
some insights can be gained by studying a specific model. 
In this section, we compute the corrections to the power 
spectrum due to the nonlocal bias induced by gravita- 
tional evolution derived in [21] (see also [25H2S])- We 
will first give the expressions for the nonlocal corrections 
to the power spectrum. We then use the estimated values 
of the nonlocal bias parameters to illustrate the impor- 
tance of these corrections to the halo power spectrum. 
Since the local model is often assumed, we then go on to 
study the effects in the bias parameters and the resultant 
power spectrum when the underlying model is nonlocal 
but a local model is assumed. 



A. Nonlocal bias contribution to the power 
spectrum 

In this section, we study the corrections to the power 
spectrum due to the nonlocal bias induced by gravita- 
tional evolution. 

We take the nonlocal bias model derived in [Mj. Let 
us first review some of the results that we will need here. 



£2($v,$2LPT) = Vy$ v Vtf$ 2 LPT ~ V 2 $ v V 2 $ 2 LPT, 

(25) 

with $2LPT being the potential arising in second order 
Lagrangian perturbation theory given by 



V 2 $2LPT = -a 2 ($v). 

The nonlocal bias parameters 72 and 73 are 



- 2 = -2( 6l _i) (1 _ 1), f. 

18 7 

73 = ' „„ - [11 + -3 

g 5 



7 

(fti - 1) 

63 



(26) 

(27) 
(28) 



where g denotes the ratio of growth factor between the 
observation time and the formation time, D/D*. We 
note that the values of these nonlocal bias parameters 
are model-dependent, but the tensorial structure of the 
nonlocal terms should be generic. It is worth comment- 
ing on smoothing when these nonlocal terms are present. 
Here we regard these nonlocal terms as extra degrees of 
freedom, in addition to density 6. Hence the smoothing 
operator applies to Q2 and G3 in the same way as to 5. 



Using the prescription in Eq. ( 22 ) , we can compute 



the correction to the power spectrum due to the nonlocal 
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bias terms. Up to 1-loop, they are given by 
H ll2 (k) = 8b ll2 P L (k)W 2 (kR) J d 3 q 
x (Mk- q , q -l)G 2 (-k,q)P L (<?) 



Ab ll2 W 2 (kR) J d 3 gP 2 (q, k - q) 



6272 



Pi 



2LPT 

&173 



x (Mq, k - q -l)P L (?)P L (|k-q|), 
(k) = 2b 2l2 W{kR) J d 3 g(/4,k- q - 1) 
x W(qR)W{\k-C l \R)P L (q)P L (\k^ 
,(*) - 2 7 2 IF 2 (fci?) | d 3 g( Mqik _ q - l) 2 

x PL^adk-ql), 
246i 73i 



(fe) = - 



7 



-TT" 2 (fcP)P L (fc) / d d q 



(29) 



(30) 



(31) 



(32) 



(33) 



Pfe l73 (fe) 



(Mq, k +q - l)(/4,q ~ 1)-Pl(<Z), 

0, 



(34) 

where /i q , p denotes q- p/(qp). The meaning of each term 
should be self-evident. There are two terms proportional 



to &172. Eq. (33) is due to the 61 term and the 73 term 



with the 2LPT potential. There is another term propor- 
tional to 6172/3, but it is equal to (61 — 1)/(26 2 )P{, 272 , so 
we do not show it explicitly here. In all the terms except 
for Eq. (31 1, the window function simply factors out. In- 



terestingly, Eq. ( 29 ) and ( 33 ) can be further simplified to 



the same functional form up to a proportionality factor 



A ^±W 2 (kR) 



dq 



P(q) 



Akq{k 2 + q 2 



x (3fc 4 - Uk 2 q 2 + 3q 4 ) + 3(k 2 - q 2 f In 



(fc- 
(k- 



qf 
q) 2 



with A = (vr/14)&i72 for P 6 \ 72 and A = (7r/14)6i 73 for 



P fc 2 i73 T From l 2 4 ( and also Ec l- f27j) and (|28j) ), we 
expect 72 to be generically negative, while 73 is positive 
and smaller in magnitude, and so these two terms partly 
cancel each other. 

In Fig. [9j we show the correction terms Eq. (29 33) 
using the bias parameters b± = 2, b 2 = 1.34, 72 = —0.5, 
and 73 = 0.1. We have normalized the correction terms 
with respect to the tree-level term P^' 1 — 6 2 Pl, where Pl 
is the linear power spectrum. At large scales, the contri- 
bution due to the nonlocal terms vanish. This is because 
the nonlocal kernels are derived from momentum conser- 
vation equations, and so they vanish when the external 
wave number goes to zero [32] . At k ~ 0.1 h/Mpc, the 
correction due to these nonlocal terms is 30 per cent com- 
pared to Pj,' • The precise contribution depends on the 
nonlocal bias parameters, which should be obtained by 
fitting to the full model. Note that in making this plot 
we have used a window size of 16 Mpc//i, but the window 
function has negligible effect in the ratios shown in the 
figure. 
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FIG. 9. The total nonlocal bias contribution to the power 
spectrum (solid line) and its individual nonlocal contributions 
(dashed lines). The corrections to the halo power spectrum 
due to nonlocal bias, are normalized with respect to the linear 
bias term using a linear power spectrum P^' 1 = 61 Pl. The 
bias parameters used are 61 = 2, 62 = 1.34, 72 = —0.5 and 
73 = 0.1. The nonlocal corrections vanish at large scales and 
start to contribute significantly at k ~ 0.1 h/Mpc. 



Figure [10] shows the ratio of the effective local model 



(eq. ( 36 ) below) power spectrum to the underlying nonlo- 
cal model. For the underlying nonlocal model, the local 



terms Eq. ( fll||T4| ) and the nonlocal terms Eq. ( 29 34 ) 
are used, and the bias parameters are the same as those 
111 Fi g. [9] T he effect ive local model includes the terms 
Eq. (11 14) and the corresponding effective local bias 
parameters are obtained using Eq. (38). As local bias- 



ing prescription is a sensible approximation only when 
the scale dependence is properly taken into account, we 
have used our mapping of scales, Eq. (17). That is, each 



k mode is computed using a different window function. 
We see from this that nonlocal bias effects have a mini- 
mal impact on power spectrum predictions in the weakly 
nonlinear regime. This prediction is in good agreement 
with our estimates for nonlocal effects in the power spec- 
trum from the sampling method below. This should be 
contrasted with Fig.[9j from which one may conclude that 
the effect of the nonlocal bias on the power spectrum is 
quite significant. This is because local biasing prescrip- 
tion constructed at the scale R yields a good effective 
description for the halo field and it can partly absorb the 
nonlocal effects. Note that without applying Eq. (17), 
the results hardly make any sense. 



B. Errors from assuming local bias 

In this subsection, we shall approach the following 
question in more detail: suppose that the underlying bi- 
asing model is given by some nonlocal prescription, such 
as Eq. (22), how biased the results would be if the lo- 



cal biasing prescription is assumed? To study this in a 
controlled manner, we construct what we call man-made 
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the local model 




10'" 

k /(h/Mpc) 



FIG. 10. The ratio of the effective local model power spec- 
trum to the underlying nonlocal model. The power spectra 
are computed upto 1-loop, and the scale dependence of local 
biasing prescription has been taken into account. Nonlocal 
bias effects have a small impact on the power spectrum. 



b?S R 



Y R ' 



(36) 



We shall see that b R is largely unaffected, while b R will 
be systematically shifted due to the presence of Gir- 

Let us first estimate the value of b R . The term Q 2 in 
real space is given by 



0a (x) = J d 3 q 1 d 3 q 2 e i ^+^ x (n 2 12 - 1) 



Because of 
£ 2 (0) only. 



x <5 L (qi)<5 L (q 2 )- 

spatial homogeneity, we can 
The average of the kernel, 



(37) 

consider 
1, 



M12 



/ d 3 9l d 3 g 2 (^ 2 - d 3 qi d 3 q 2 ), is -2/3. This 

approximation has been checked against simulations and 
found to be good for a smoothing scale of 40 Mpc/h of a 
top-hat window |24j . Thus, approximating the kernel by 
its average, we expect to have 



nonlocal halo density from the dark matter density field 
with some input bias parameters. We will compare the 
power spectrum obtained in four different ways: the nu- 
merical power spectrum computed directly from the man- 
made halo density field, the 1-loop perturbation theory 
results from the input nonlocal model, the power spec- 
trum from shuffling of the man-made halo density field, 
and the 1-loop results from the effective local model. Our 
approach is similar to Ref. [20 , in which artificial local 
models are constructed to test the importance of higher 
order terms in the bispectrum. 

As found in [24] from numerical simulations, the dom- 
inant nonlocal contribution to biasing prescription orig- 
inates from the Q 2 term, and so for simplicity, we shall 
consider the nonlocal model up to second order in 5 



:72- 



(38) 



?hR 



Mi 



1 2 Q 2 R- 



(35) 



We shall refer to this model as the Q 2 model. The 
field Sr is obtained by simply smoothing the nonlinear 
dark matter density 5 from the iV-body simulation. In 



Eq. ( 22 1 , Q 2 is constructed from the velocity potential. 
As obtaining the velocity potential is cumbersome and 
the linear power spectrum is close to the nonlinear ve- 
locity divergence power spectrum [37], we shall use the 
linear gravitational potential instead of the nonlinear ve- 
locity potential. The error committed here is third or- 
der in linear To construct G 2 r, we first compute 
Vij$ = (k i kj/k 2 )Si J ('k) in Fourier space. We then in- 
verse Fourier transform to get Vij$(x), from which we 
construct Q 2 using Q 2 = (V„ $) 2 - (V 2 $) 2 . Finally, after 
applying the smoothing window, we arrive at Q 2R . 

We will apply the shuffling technique to the man-made 
Sh R - Although we anticipate this to be equivalent to the 
local biasing procedure, we will explicitly check this by 
fitting to the scatter plot obtained from Eq. (35) with 



Note that we have ignored the impact of the smoothing 
window in this simple estimate. 

The shift of the local bias parameter b 2 depends on 
the form of the nonlocal term considered. The nonlocal 
term generated by gravitational evolution has the form 
of Q 2 , which is naturally fixed by Galilean invariance [24) . 
However, one could also shuffle some of local term am- 
plitude inside Q 2 to the local quadratic bias and have 
a nonlocal operator proportional to the quadrupole, e.g. 
(V i: ,$) 2 - 1/3(V 2 <&) 2 as in [28], in which case this nonlo- 
cal term does not shift the local quadratic bias parameter 
upon spherical averaging. 

The input bias parameters are chosen to be b\ = 2 
and b 2 = 1.34, and four different values of 72 for com- 
parison: —0.5, —0.2, 0.1 and 1, although we will only 
show plots for 72 = —0.5. In Fig. [II] we show the scat- 
ter plots for the man-made halo density field constructed 
with 72 = —0.5, which is roughly the mean value of 72 
found in [53]. Top-hat windows with R being 8, 16, 30, 
50, and 100 Mpc/h respectively are used. On top of the 
scatter plots, we have shown the best fit using the effec- 
tive local model Eq. ( 36 1 . If our estimate Eq. ( 38 ) were 
exact, we would expect b R = 2 and, for 72 = —0.5, b R to 
be 2.01. Indeed the best fit b R is close to 2 irrespective 
of the smoothing scale. On the other hand, from Fig. [TT] 
the b R estimate is accurate only for large smoothing win- 
dow. As R increases, the best fit b R increases and tends 
to the estimated value. This behavior is most likely due 
to the effect of smoothing, note that in the man-made 
halo field there is only one window function acting on 
Q 2 , and we hav e converted the shift on the quadratic 
bias in Eq. (38) ignoring this (i.e. 5 R contains two win- 
dows, not one). Also note that b R is non-zero for small 
R, this can be expected from doing a conditional average 
of Q 2 at fixed S (rather than our spherical, unconditional, 
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FIG. 11. The scatter plots from a "man-made" halo density field constructed using b\ = 2, &2 = 1-34 and 72 = —0.5. Top-hat 
window of smoothing scales 8, 16, 30, 50 and 100 Mpc//t are used. Two redshifts are considered: 0.97 (the first row) and 
(the second row). Shown on top of the scatter plot is the best fit model using local bias. The best fit 6f agrees with the input 
parameter for all the smoothing scales, while the best fit 6^ agrees with the naive estimated value 2.01 only for large smoothing 
scales. 



average) , which gives an extra contribution to Eq. ( 38 ) 



of bg = — crfj/3. We see a stronger amplitude than — 1/3, 
and this may again related to the mismatch in the num- 
ber of window functions in the conditional average argu- 
ment here. Note that the values of the bias parameters 
are not chosen carefully to ensure S^r > — 1. However, 
for a reasonably large smoothing scale, the man-made 
density obeys ShR > — 1. 

In this deterministic model, the scatter originates from 
the projection of the G2 term onto the 5h ~ 5 plane. We 
can compare the scatter due to the projection effect of 
the nonlocal terms with that in the standard scatter plots 
from simulations. If we consider halos from simulations 
with values of and b^ similar to those we considered 
here, and with the same smoothing scale, even when we 
take some large value such as 72 = 1, the scatter in the 
man-made scatter plot is still much smaller than that 
in the simulation scatter plots. Thus the scatter in the 
scatter plots from simulations cannot be solely explained 
by the projection of the nonlocal terms alone, similar to 
what is found in [21]. For other values of 72, the same 
trend in the scatter plots holds, and so we do not show 



them here. Of course, the smaller the value of I72I, the 
smaller the scatter in the man-made scatter plot. 

In Fig. |12[ we compare the power spectrum obtained 
from perturbation theory using the nonlocal model, shuf- 
fling, and perturbation theory using the effective local 
model against the numerical power spectrum from the 
man-made halo density. In Fig. |12[ 72 is again chosen 
to be —0.5. The nonlocal one-loop perturbation results 



include the local contribution Eqs. ( 11 - 14 1 and the nonlo- 



cal contribution Eqs. (30 32), computed using the input 
bias parameters. We note that Eq. (29) is not present 



because we have used the linear potential to construct 
Qi instead of the nonlinear velocity potential. The one- 
loop effective local model calculation includes only the 
local contribution Eq. (11 14), but the bias parameters 



are the best fits in the scatter plots. 

The nonlocal bias perturbative results agree very well 
with the original man-made model results for large scales 
(the first row in Fig. 12). However, as the smooth- 



ing scales decreases, the agreement deteriorates as the 
smoothing scale decreases, especially for 16 and 8 Mpc/ft,. 
The agreement is also better for z = 0.97 than for z = 0. 
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FIG. 12. Different power spectrum calculations compared to the true (original) power spectrum of the "man-made" halo 
density field constructed with 72 = —0.5 at various smoothing scales. In the first row, we show the power spectra obtained 
from 1-loop perturbation theory using the nonlocal model supplemented with the input bias parameters. In the second row, we 
show the power spectra from shuffling of the "man-made" halo field. Shown in the last row are the 1-loop perturbation theory 
results using the effective local model with the local bias parameters fitted from the scatter plots. All of them are normalized 
by the corresponding original power spectra. Two redshifts are considered: 0.97 (left column) and (right column). In each 
plot, five smoothing scales are used: 8, 16, 30, 50 and 100 Mpc//i (same as Fig. 111. Some of curves with small smoothing 
scales fall outside the range shown. Also, for each curve we show only up to the scale at which it starts to rapidly oscillate due 
to the top-hat window function. For the shuffling and the effective local model results, we have also plotted the points obtained 
using Eq. (Ff\ (squares). 



This is consistent with larger nonlinearity in the con- 
structed field when the smoothing scale is small or the 
redshift is low, and so perturbation theory is not so ac- 
curate. 



After shuffling (middle panels in Fig. 12 1, the results 
are indeed biased, especially for small smoothing scales, 
and the perturbation results using the effective local 
model (bottom panels) share qualitatively similar trends 
as the shuffling results although sometimes it is not clear- 
cut. Let us first apply the insights from the local model, 
especially the behavior of the effective bias parameters, 
to interpret the shuffling results. For the smoothing scale 
as large as 100 Mpc/h, the loop correction is negligible, 
and so its value is mainly determined by the best fit b^. 
As the smoothing scale decreases, the loop corrections 



become more important. As we saw in Section [VAJ the 
corrections due to the nonlocal term are significant only 
for k > 0.1/i/Mpc, while the local correction, especially 
Eq. ( 13 1, which is approximately proportional to Ph(k), is 



already significant at large scales. As a result of a system- 
atically enhanced due to G2 , the shuffling power spec- 
trum is higher than the original one for R < 50Mpc//i. 
However, we see that in Fig. 11 for small R such as 16 
and 8 Mpc/h as R decreases the effective &f also de- 
creases, this effect is especially significant at z = 0. This 
reduction in b§ for small R can explain why the shuf- 
fled power spectrum at z = suddenly plummets for 
R = 8Mpc//i. The 1-loop results based on the effective 
local model are consistent with the trends discussed for 
the shuffled halo power spectrum, but generally the ef- 
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fects are stronger. Again perturbation theory fairs badly 
for fields constructed using small smoothing scales or at 
low redshift. In some sense, shuffling is the best local bi- 
asing prescription can achieve because all the information 
in the whole scatter plot is used, and the power spectrum 
is computed numerically. 

We would like to clarify the relation between the man- 
made halo density field here and the halo field from sim- 
ulations. The man-made halo density are constructed at 
some scale R with the input bias parameters, while we 
have seen that scatter bias parameters of the halo density 
field run with scale. Hence we can regard the man-made 
density field as a "snapshot" of the halo density from 
simulation at scale R. This viewpoint is in line with our 
formulation of local bias. 

The nonlocal term Qi with negative 72 after the shuf- 
fling increases the effective frfs an d so the shuffling power 
spectrum is higher than the original model at large scales. 
This agrees with the trends that we saw in Section |II B| 
However, besides the contribution arising from C/2, there 
is another piece from the local 62 (in the spherical ap- 
proximation it is given by Eq. 38 1 . Because it is difficult 



to separate these two components, inspecting P(k) at 
large scales is not an effective way to detect the nonlocal 
bias. In fact, we argued in Section |IV] that the biasing 
prescription obtained with smoothing scale R, the effec- 
tive bias parameters can absorb some of the effects due 
the nonlocal terms at that scale. In the case of bispec- 
trum, which will be considered elsewhere, is expected to 
be more effecitve in detecting the nonlocality in the bi- 
asing prescription. 

So far we have discussed the behavior of these models 
in the small k region, k < 0.05 h/Mpc. According to 
our formulation, for the halo biasing model constructed 
at scale i?, we should focus on the k ~ \ jR only. 



For each curve (second and third rows in Fig. 12), we 



have also shown the point mapped using Eq. (17). Over- 



all for the five smoothing scales considered, the deviations 
of the shuffling results from the input nonlocal model are 
within 3% up to k ~ 0.2 h/Mpc. The difference is small, 
but is still slightly larger than what we found between the 
shuffling results and the original simulations. However, 
we only include the G2 term using the linear potential, 
in particular the difference between linear potential and 
the nonlinear potential may not be negligible in the non- 



linear regime. Note that Fig. 10 is quite similar to curves 
formed by the square symbols in Fig. [12] especially the 
middle panel. Thus both approaches suggest that the ef- 
fect of the nonlocal bias on the power spectrum is with a 
few per cents up to the weakly nonlinear regime. Overall, 
the good agreement supports our claim that the effective 
description constructed at the scale R is accurate at that 
scale alone. 



VI. CONCLUSIONS 

In order to test the validity of the local biasing pre- 
scription and to examine the impact of the nonlocality, 
we constructed a halo field by sampling the dark matter- 
halo distribution from simulations. The method is im- 
plemented by first constructing a scatter plot between 
Sr and S^r, and then dividing Sr into bins and shuffling 
the halos S^r within the same dark matter density bin. 
Because we randomly permute the halos within the same 
Sr bin, the halo field constructed in this way has the 
same scatter plot as the original one. This method can 
also be reduced to a mean (scatterless) biasing prescrip- 
tion by replacing the halo density by its mean in the bin. 
We then compute numerically the power spectrum of the 
halo fluctuations defined in this way. However, we find 
that although the shuffling method keeps the (one-point) 
scatter in the scatter plot, any stochastic noise (includ- 
ing shot noise) is destroyed by the shuffling process at 
the two-point level, in the sense that the noise is elim- 
inated by ensemble average when computing the power 
spectrum (see Appendix). Thus we obtain that the dif- 
ference between the shuffling and the mean methods is 
negligible at the end. 

The method developed here differs from the usual test 
of local biasing in several aspects. First the shuffling 
method keeps all the information in the scatter plot be- 
tween Sr and ShR- As we do not fit to the distribution 
by a polynomial, we essentially test the series in Eq. 
without truncation. Often, perturbation theory is in- 
voked to compute the correlators in testing local bias- 
ing. When such predictions break down, it is not clear 
if local biasing fails and/or perturbation theory breaks 
down. As we compute the power spectrum numerically, 
i.e. nonperturbatively, we can test the validity of local 
bias without worrying about the breakdown of perturba- 
tion theory. More importantly, this enables us to extend 
the test towards the nonlinear regime. 

In what has become to a large extent the standard 
interpretation of local bias in the literature [TTJ, [2TJ 
[22j l4ll 148] , loop corrections from nonlinear bias generate 
significant corrections at large scales; for the power spec- 
trum this is essentially a renormalization of the linear 
bias and a renormalization of the shot-noise. Depend- 
ing on the particular implementation, such renormaliza- 
tions are poorly determined (depending on an unknown 
smoothing scale R) or arbitrarily left as free parameters. 
This standard perturbative calculation is illustrated in 
Fig. [6j showing the dependence on R. Given our sam- 
pling method, we can perform a similar calculation (see 
Figs. [T] and [2]), again at fixed R, and check that the non- 
perturbative results are quite similar to those obtained 
perturbatively. Thus, the problem originates from the 
interpretation of local biasing itself, not in perturbation 
theory. 

Our proposal is that local bias parameters obtained 
using a smoothing scale R can only be applied to com- 
puting the halo power spectrum at scales k ~ 1/R (or, 
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more precisely, Eq. 17 1. Our calculations thus can au- 
tomatically include the running of bias parameters (see 
Fig. [5]) and give vanishing small loop corrections at low-fc 
since the smoothing scale R — > 00 as k — » 0. This brings 
back a fairly good agreement at large scales when bias 
parameters measured from a scatter plot are used and 
the calculation done perturbatively (see Fig. [7]) . For ex- 
ample, we obtain 4 % agreement up to k ~ 0.1 h/Mpc 
at z = 0.97 and 7 % agreement up to k ~ 0.1 h/Mpc at 
z = 0. A much better agreement is obtained when the 
calculation is done non-perturbatively via our sampling 
method. We stress that our interpretation of the smooth- 
ing scale in local bias is precisely the same as originally 
put forward by Fry & Gaztahaga [9] for the computation 
of counts in cells. In that case there is no contribution of 
non-linear bias loops to the variance of counts in cells in 
the large-cell limit (linear bias and shot-noise renormal- 
izations vanish). 

Under the "standard interpretation" , nonlinear bias 
loops generate a constant power at very large scales (shot- 
noise renormalization) , which in principle can overwhelm 
the primordial scale-invariant spectrum that drops with 
k as k — > 0. By contrast, our interpretation posits that 
all nonlinear bias loops are highly suppressed at low-fc as 
the loop momenta only run up to ~ k. We looked for ev- 
idence of shot noise renormalization at wavenumbers be- 
low the turn-over of the power spectrum k < 0.01 h/Mpc 
and have not found any, except for the sub-Poisson effect 
due to halo exclusion (see Figure [8J). This supports our 
proposal for how nonlinear bias loops should be treated, 
and that the halo power spectrum in the low-fc limit is 
given by linear bias with no extra contributions (apart 
from non-perturbative corrections such as the shot noise 
and halo exclusion). We also find from the sampling 
method that &h and b c agree with each other at large 
scales, which lends further support to our approach. 

To help interpret the sampling results, we also con- 
structed and artificial "man-made" halo density using the 
nonlocal bias model described in [24] , and apply the sam- 
pling technique to it. The middle panel in Fig. |12| shows 
that, after the mapping given by Eq. (17) denoted by 



square symbols, the results from shuffling (which makes 
the bias local) are very close to the results directly from 
the input model. This indicates that the nonlocality of 
bias does affect the power spectrum only very slightly, at 
the few percent level. This is also in agreement with our 
estimates from Fig. [TO] using perturbation theory. 
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Appendix A: More on Poisson shot noise and 
stochastic noise 



1. Poisson shot noise 

To derive the standard Poisson shot noise result, the 
density field is represented by a distribution of point par- 
ticles. Then the number density is given by 



"0 



W = ^<5 D (x- Xi ). 



(Al) 



The reason for the subscript will become clear in a 
moment. The covariance matrix of the number density 
field is given by 

(no(xi)n (x 2 )) = 2j(5 D (xi - Xi)<5 D (x 2 - Xj )). (A2) 

Because Dirac 5 functions are very singular when they 
step on each other, special care is taken for i = j. The 
sum on the right hand side of Eq. ( A2 ) is split into two 



parts, one with i = j and another with i ^ j. Therefore, 
Eq. ( A2 1 can be written as 



(n (xi)n (x 2 )) = n <5 D (xi- X2 )+?io[ 1 + £c(^i2)], (A3) 
where uq is the mean number density 

no =^ v 5 D (x-x i )), (A4) 

i 

and we have replaced the i 7^ j part with the continuous 
correlation function £ c as 

5^(&(xi-x i )5 D (x 2 -x J -)) =f%[l + &(xvt)]. (A5) 

Writing (no(xi)no(x2)) as nf\l + £(#12)] we arrive at 

ffcia) ^cM + ^Mxi-xa). (A6) 
n 

Upon Fourier transform, we get the familiar result 

1 1 



P(fc) = P c (fc) + 



(2tt)3 n ■ 



(A7) 



This derivation here highlights that the Poisson shot 
noise arises as long as the objects are point particles. 



1 http ://lss. phy . vanderbilt . edu/lasdamas 
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In simulations, the dark matter particles and halos are 
indeed treated as point particles. 

Now we consider the weighted number density 



j(x) = y^w,(5 D (x-Xj 



(A8) 



where Wi is some weight. A similar exercise yields 

e(^i2)=?c(x 12 ) + 3ffc(xi-x 2 ), (A9) 

where n w and are given by 

n w = ^2(wiS D (x - x.,)), (AlO) 

i 

^ = £>^ D (x- Xi )). (All) 

i 

Therefore the Poisson shot noise is now given by 



p(k) = p c (k) 



(2tt) 3 



(A12) 



When the weight is constant, we recover Eq. ( A7 ). One 



physically interesting weighting scheme is weighting by 
mass or even some power of it Mass weighting 

would allow us to select the more biased objects in the 
sample. One can show that for the Poisson model, the 
shot noise is minimized for equal weight. However, the 
halo power spectrum also increases as the more biased ha- 
los gain weight. Nonetheless wc find that the increase in 
shot noise level is generally larger than the enhancement 
due to the halo bias. Ref. [49J [50] reported mass weight- 
ing can reduce shot noise because the sub-Poisson effects 
due to halo exclusion become more significant. In Fig. 13 
we show the halo auto power spectrum with shot noise 
subtracted and the corresponding shot noise for various 
halo groups. Mass weighting m k , with k — 0, 1, and 2 
are considered. For the low mass groups (I and II) mass 
weighting has little effects on the overall power spectrum 
and its shot noise as the mass bins are quite narrow. On 
the other hand, for the high mass groups (II and IV), 
mass weighting increases the overall power spectrum and 
the shot noise is enhanced significantly. For the weight- 
ing by m 2 , the shot noise is comparable or higher than 
the signal on scales k < 0.1 /i/Mpc. 



2. Smoothing and shuffling of the stochastic noise 

We first discuss the case when the density fields are 
smoothed with some window functions. Here we denote 
the smoothing operator as R (this is the usual convo- 
lution with a top-hat window function in real space). 
Although after smoothing the particles spread out, the 
smoothing operator is constant with respect to the en- 
semble average operator, i.e. the smoothing operator 
commutes with the ensemble average operator. The 



smoothing operator leaves the original field coherent, and 
so we have 



(i?in(xi)i? 2 n(x 2 )) = ^ii? 2 (n(xi)n(x 2 )). 



(A13) 



Therefore when the fields are smoothed by some win- 
dow function the shot noise still contributes to the power 
spectrum except with the factors of the window functions 
(Eq. §). 

In addition to the mean biasing relation, we now in- 
clude a stochastic noise cr, then S^r is given by 



with 



ShR = Ir(Sr) + 

Ir^r) = (KrISr), 



(A14) 

(A15) 
(A16) 



To be general, €r can include sources of stochasticity 
other than the Poisson shot noise. The stochastic random 
field e R satisfies 



(e R ) = 0. 



(A17) 



We do not assume the two-point correlation function of 
cr to be Poisson, but we assume it to be uncorrelated 
with the dark matter density 5r. 

Eq. ( A14[ ) consists of two parts, the coherent part 
(/i?(<5i?))i whose coherent length is large or comparable to 
the smoothing scale, and the incoherent part (en), whose 
coherent length is much smaller than the smoothing scale. 
These two parts respond to smoothing differently. The 
coherent part is affected little by smoothing, while the 
incoherent part is greatly suppressed. 

When we do shuffling of the halo overdensity within the 
same bin of Sr, we essentially apply some random distri- 
bution function to the density field. For convenience, we 
represent this process by the shuffling operator S. Note 
that this shuffling operator is a stochastic operator, and 
it does not commutes with the ensemble average opera- 
tor. Thus the expectation value of the shuffled halo field 
is given by 

(£[/fl(Mxi)) + e*(xi)]£[/fl(Mxa)) + e*(x 2 )]) 
= (/ fl (Mxi))MMx 2 ))> + (Se R ( Xl )Se R ( X2 )) 

(A18) 

We have used the fact that S is designed to leave 
/fl(<5fl( x )) invariant. On the other hand, we can think of 
S as a random distribution function which distributes the 
"parts" of the particle everywhere, and so we can expect 
its contribution is negligible. Indeed this is consistent 
with the what we observe: difference between shuffling 
and sampling by the mean is negligible. 

We did a simple test for the case of Poisson white 
noise. We generate 10 6 particles in a cubic box of size 
2400 Mpc//i randomly. The particles are interpolated 
to a 240 3 grid. We have considered three interpolating 
kernel with increasing smothness: NGP, CIC and a 4 th 
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kj (h Mpc" 1 ) 



k/ (h Mpc" 1 ) 



FIG. 13. The halo power spectrum with shot noise subtracted (lines with symbols) and the corresponding shot noise (straight 
lines). Mass weighting with weight m k with k — O(blue), l(red), and 2(green) are shown for each halo group. Mass weighting 
has little effect for the low mass groups (I and III), while the power spectrum and its shot noise are significantly increased for 
the massive groups (II and IV). 



order kernel. As the order of the interpolation increases, 
each particle mass is interpolated to increasing number 
of neighbouring grid points, and so the resultant den- 
sity field is more and more nonlocal. We then divide 
by the corresponding window function in Fourier space 
to sharpen the amplitude before computing the power 
specturm. For more details on interpolating kernels, see 
[52"] . In the main text, only the 4 th order kernel is used. 
We will apply the top-hat window functions of radius 
(no smoothing), 8, 30, and 100 Mpc//i respectively. 

We will compare the original power spectrum with var- 
ious smoothing scales against the one obtained by shuf- 
fling. The shuffling preocedure is implemented as follows. 
After interpolating the particles to the grid points and 
smoothing the field with the window function in Fourier 
space, we inverse Fourier transform the field back to real 
space. The grids points are randomly partitioned into 
several parts, e.g. 4, each part has equal number of par- 
ticles. Becasue of the randomness of the partition of the 
grid points, the number of parts is irrelevant. For exam- 
ple, we have checked that using 4 parts gives the same 
results as 24 parts. This is consistent with shuffling pro- 



cedure used in the main text because we assume the pati- 
tion of the grid points is based on the local values of Sr, 
which is uncorrelated with the noise. The grid points are 
randomly permuted within the same part. The permuted 
grid points are used to compute the power specturm as 
usual, but without further applying a smoothing window. 

In Fig. |14| we show the original white noise power spec- 
trum with various smoothing scales, and the correspond- 
ing power specturm obtained by shuffling. The origi- 
nal smoothed power spectrum (solid) agrees with the ex- 
pected power Eq. ^ (dotted-dashed) rather well. The 
discrepancy we see for the low order kernels, especially 
NGP, at medium to high k regime is because dividing by 
the Fourier transform of the window function does not 
work so well for low order interpolating kernels. 

We now look at the shuffled power spectrum (dashed) . 
Although the correlation function of the original Poisson 
white noise is strictly local, both the smoothing process 
and the interpolation make the density field nonlocal. 
Random distribution of the density field values at the 
grid point in shuffling will destroy the correlation in the 
nonlocal density field. For the same interpolating func- 
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FIG. 14. The original white noise power specturm and the one with shuffling. The three columns correspond to different 
interpolating kernels: NGP, CIC and 4 th order (from left to right). Within each subplot, we show the power specturm obtained 
with different smoothing scales (blue), 8 (red), 30 (green) and 100 Mpc/ft (cyan) respectively. The original power spectrum 
is shown as data points with error bars connected with solid lines and the shuffled one as data points with error bars connected 
with dashed lines. Besides, we also show the expected white noise power spectrum using Eq. (J3j) (dotted-dashed lines). 



tion, the larger the smoothing scale, the greater the re- 
duction in power after shuffling. Similarly the higher the 
order of the interpolating kernel, the lower the power af- 
ter shuffling. 



In summary, in the numerical power spectrum com- 
puted in the usual way, the small scale stochasticity is 
preserved, while the stochastic noise is destroyed by the 
sampling method. 
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